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ABSTRACT 

We derive asymptotics of moments and identify limiting distributions, under the 
random permutation model on m-ary search trees, for functionals that satisfy recurrence 
relations of a simple additive form. Many important functionals including the space 
requirement, internal path length, and the so-called shape functional fall under this 
framework. The approach is based on establishing transfer theorems that link the 
order of growth of the input into a particular (deterministic) recurrence to the order of 
growth of the output. The transfer theorems are used in conjunction with the method of 
moments to establish limit laws. It is shown that (i) for small toll sequences (t n ) [roughly, 
t n = (^(n 1 / 2 )] we have asymptotic normality if m < 26 and typically periodic behavior 
if to > 27; (ii) for moderate toll sequences [roughly, t n = uj{n 1 / 2 ) but t n = o(n)] we 
have convergence to non-normal distributions if to < too (where too > 26) and typically 
periodic behavior if to > too + 1; and (iii) for large toll sequences [roughly, t n = u>(n)} 
we have convergence to non-normal distributions for all values of to. 
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1 Background and notation 

We start by giving a brief overview of search trees which are fundamental data structures 
in computer science used in searching and sorting. For integer m > 2, the m-ary search 
tree, or multiway tree, generalizes the binary search tree. The quantity m is called the 
branching factor. According to , search trees of branching factors higher than 2 were 
first suggested by Muntz and Uzgalis |2J "to solve internal memory problems with large 
quantities of data." For more background we refer the reader to and [T7] . 

An m-ary tree is a rooted tree with at most m "children" for each node (vertex), each 
child of a node being distinguished as one of m possible types. Recursively expressed, an 
m-ary tree either is empty or consists of a distinguished node (called the root) together 
with an ordered m-tuple of subtrees, each of which is an m-ary tree. 

An m-ary search tree is an m-ary tree in which each node has the capacity to 
contain m — 1 elements of some linearly ordered set, called the set of keys. In typical 
implementations of m-ary search trees, the keys at each node are stored in increasing 
order and at each node one has m pointers to the subtrees. By spreading the input data 
in m directions instead of only 2, as is the case for a binary search tree, one seeks to 
have shorter path lengths and thus quicker searches. 

We consider the space of m-ary search trees on n keys, and assume that the keys 
are linearly ordered. Hence, without loss of generality, we can take the set of keys to be 
[n] := {1, 2, . . . , n}. We construct an m-ary search tree from a sequence s of n distinct 
keys in the following way: 

(i) If n < m, then all the keys are stored in the root node in increasing order. 

(ii) If n > m, then the first m — 1 keys in the sequence are stored in the root in 
increasing order, and the remaining n — (m — 1) keys are stored in the subtrees 
subject to the condition that if o\ < 02 < • • • < <T m -l denotes the ordered sequence 
of keys in the root, then the keys in the jth subtree are those that lie between 
o-j-X and <jj, where o"o := and a m := n + 1, sequenced as in s. 



(iii) All the subtrees are m-ary search trees that satisfy conditions |(i)[ |(ii)[ and (iii) 

It is our goal to study additive functionals (see Definition ll.ljl defined on m-ary 
search trees. Such functionals represent the cost of divide-and-conquer algorithms, 
reflecting the inherent recursive nature of the algorithms. 

Let T be an m-ary search tree. We use \T\ to denote the number of keys in T. Call 
a node full if it contains m — 1 keys. For 1 < j < m, let Lj(T) denote the jth subtree 
pendant from the root of T. For a node x in T, write T x for the subtree of T consisting 
of x and its descendants, with x as root. This notation is illustrated in Figure ^ 

Definition 1.1. Fix m > 2. We will call a functional / on m-ary search trees an 
additive tree functional if it satisfies the recurrence 

m 

f(T) = J2f(Li(T))+t m , (1.1) 
i=i 

for any tree T with \T\ > m — 1. Here (t n )n>m-l is a given sequence, henceforth called 
the toll sequence or toll function. 
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Figure 1: Example of notation for a quaternary tree T: here, \T\ = 15 and T x = L\. 



Note that the recurrence (jl.lj) does not make any reference to t n for < n < m — 2 
nor specify f(T) for < |T| < m — 2. 

Several interesting examples can be cast as additive functionals. 

Example 1.2. If we specify f(T) arbitrarily for < \T\ < m — 2 and take t n = c for n > 
m — 1, we obtain the "additive functional" framework of |171 §3.1]. (Our definition 
of an additive functional generalizes this notion.) In particular if we define /(0) := 
and f{T) := 1 for the unique m-ary search tree T on n keys for 1 < n < m — 2 and let 
t n = 1 for n > m — 1, then /(T) counts the number of nodes in T and thus gives the 
space requirement functional discussed in [171 §3.4]. 

Example 1.3. If we define f(T) := when < \T\ < m — 2 and t n := n — {m — 1) for 
n > m — 1 then / is the internal path length functional discussed in [171 §3.5]: f(T) is 
the sum of all root-to- key distances in T. 

Example 1.4. As described above, each permutation of [n] gives rise to an m-ary search 
tree. Suppose we place the uniform distribution on such permutations. This induces 
a distribution on m-ary search trees called the random permutation model. Denote its 
probability mass function by Q. It is important to note that Q is not uniform, since 
different permutations can give rise to the same tree. For example, the permutations 

(10, 7, 12, 4, 1, 8, 5, 6, 9, 14, 11, 2, 15, 13, 3) 

and 

(7, 10, 12, 1,4, 8, 5, 6, 9, 14, 11, 2, 15, 13, 3) 

both give rise to the quaternary search tree shown in Figure ^ Dobrow and Fill [S] 
noted that 

^rukr (L2) 



where the product in 1)1.2)1 is over all full nodes in T. This functional is sometimes called 
the shape functional as it serves as a crude measure of the "shape" of the tree, with 
"full" trees (like the complete tree) achieving larger values of Q. For further discussions 
along these lines, consult 8 and 9 . If we define f(T) := for < \T\ < m — 2 and 
t n := In ("j) for n > m — 1, then f(T) = — lnQ(T). In this work we will refer to 



3 



— InQ (rather than Q) as the shape functional. It was our desire to understand the 
distribution of the shape functional under the random permutation model that led to 
this paper. 

We study the distribution of the functional f(T) when T is given the distribution Q 
described in Example 11.41 To do this, we derive asymptotics for the moments of f(T) 
and then employ the method of moments. 

Remark. In the sequel (without any loss of applicability) we will restrict attention to 
real- valued toll sequences. 

Related work 

Chern and Hwang [2] carried out the program of this paper for the space requirement 
( Example 11.2(1 and independently discovered Theorem 12.41 5, Proposition 7]. Hwang 
and Neininger Jl] analyzed a range of (not necessarily deterministic) toll functions 
for binary search trees. In this paper we treat m-ary search trees for any m > 2, 
and for simplicity we restrict attention to deterministic toll sequences. Another closely 
related paper is where the authors consider variants of the Quicksort algorithm by 
allowing more general schemes of choosing the pivot. As in our Section \2. 21 their paper 
treats Cauchy-Euler differential equations, and a generalization of our Theorem 12.41 is 
obtained Theorem 1]. 

We obtain moment asymptotics for our additive functionals using the Asymptotic 
Transfer Theorem of Section 12.21 An alternative approach employs singularity analy- 
sis ^2] of generating functions. A sketch of this approach in the case of binary search 
trees (m = 2) is presented in jlUj . One small advantage of the present approach is that 
the conditions we impose on the toll sequence [for example, (|2.11|) ] are milder than those 
required for the application of singularity analysis. 

2 Overview: main results 

For a given toll sequence, the distribution of f(T) depends only on n. We let X n denote 
a random variable whose distribution is that of f(T) under the random permutation 
model on T. It is the distribution of X n which is the main focus of this paper. 

2.1 A common framework for all moments 

Under the random permutation model the joint distribution of the subtree sizes 
. . . , \ L m \ is uniform over all ( Ij) m-tuples of nonnegative integers that sum to 
n — (m — 1): see [TO Exercise 3.8]. We now apply the law of total expectation to com- 
pute n n (k) := EI* by conditioning on . . . , |£ m |)- Let J2j denote the sum over all 
m-tuples (ji, . . . , j m ) that sum to n — (m — 1) and ^ k the sum over all (m + l)-tuples 
) of nonnegative integers that sum to k. Then, letting © denote sums of 
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mutually independent random variables, for n > m — 1 we have 

Mn (A;) = EX* = EE(X* | . . . , \L m \) = £ E (X n © • • • © X jm + t n ) k 

\m-l) j 

\m-l) j k ^ ' ' ' ' "TO+l/ 

We can rewrite this equation as 

n-(m-l) - .\ 
JTi v — v in— 1 — 7\ 
Mn(*0 = rirT £ m 9 ftW+rnW, (2.1) 

U-iJ j=0 ^ m z / 

where 

»•»(*):=£*(* J fe )*n m+1 r i T E^^)---^(^)' ( 2 - 2 ) 

^ \ 1) • • • i mi m+1/ Vm— 1/ j 

with ^£ denoting the same sum as ^ k with the additional restriction that ki < k for 
i = 1, . . . , m. We have thus established that the moments ^ n (k) each satisfy the same 
basic recurrence in n, differing as k varies only in the non- homogeneous term r n (k). 
Observe that r n (l) = t n , the toll function. We record this important fact as 

Proposition 2.1. Under the random permutation model, all moments of an additive 
functional satisfy the basic recurrence 

n-(m-i) . A 
m ^—^ I n — 1 — 7 \ 
a n = b n + -—^ 2^ I m-2 J^'' n>m-l, (2.3) 

Vm— 1/ j=Q \ ' 



mi/i specified initial conditions (say) aj '■= bj, < j < m — 2. [Recall the statement 
following Definition \l.l\ about the initial conditions for the recurrence 

To be more specific, equation (j2.3|) is satisfied by a n = n n {k) = EI* and b n = r n (k), 
where r n {k) is defined in terms of lower-order moments of smaller trees at (|2.2|) . We 
proceed to study the recurrence relation (|2.3j) for general input (b n ) and corresponding 
output (a n ). 

2.2 Transfer theorems 

In order to analyze the recurrence relation (j2.3fl we introduce generating functions 

oo oo 

A(z) := a nZ n and B{z) := b n z n . 

n=0 n=0 

Furthermore, let x r := YYk=o( x + ^) denote the rth rising factorial power of x and x- := 
nfe=o(' E ~~ ^) the r th falling factorial power of x. Multiplying (|2.3jl by n m ~ l z n -( m ^ 1 ) 
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and summing over n > m — 1 we get, after some (routine) calculation, the differential 
equation 

A( m - 1 )( z ) = B^ m ~ l \z) + m!(l - z)-( m -^A(z). (2.4) 

Equations of the form ()2.4)) are members of a class known as Euler differential equations. 
Using the method of variation of parameters and combinatorial identities one can obtain 
the general solution to this equation. See Section |3] for a proof. 

Theorem 2.2 (Exact Transfer Theorem (ETT)). Let A and B denote the respective 
ordinary generating functions of the sequences (a n ) and (b n ) in the recurrence 1)2.3)) . Let 

m—2 oo 

B(z) := B{z) - bjz j = b n z n . (2.5) 

j=0 n=m—l 



Then 



m—l 



lib -L lit ± / ~ \ ^ . p Z 

A(z) = J2 Cj (l- z )-^ + J2 [ - "7/ / g (m " 1) (0(l-0 Aj+m - 2 dC (2-6) 

3=1 3=1 J ° 

m—l m—l /-. . „ z 

= Yl c 3d - ^)' Aj + B(z) + m! ^ 1 ~ Zj ' / J&(0(l-0^ -1 ^. ( 2 -7) 

3=1 3=1 1 jJ ° 
where if) is the indicial polynomial 



^(A) := A m_ - ml = A(A + 1) • • • (A + m - 2) - ml (2.8) 

mi/i roots 2 =: Ai, A2, . . . , A m _i in nonincreasing order of real parts. In 1)2.6)) . i/ie 
coefficients c\, C2, . . . , c m _i can 6e written explicitly in terms of the initial conditions 
bo, ■ ■ .,b m - 2 as 

. m—2 , . 
m\ v — v , fc! 



^& fc -=— i = l,...,m-l. (2.9) 



to 



In particular, 



m-2 



n - — - — y - — % — -. (2.10) 

Hm-lj^ (i + l)(j+2) 



The indicial polynomial 1)2. 8|) is well studied; see ^3 El OH and Appendix^! and also 
the related )19j . We will exploit the expression 1)2.6)1 for A(z) to relate the asymptotic 
properties of the sequence (b n ) to those of (a n ) and then use transfer theorems to derive 
limiting distributions of additive functionals. 

Remark 2.3. For computations, equation 1)2.6)1 might be easier to use when it is no 
bother to compute derivatives of B; otherwise, 1)2.7)1 is easier. Equation Q2.7)) will be 
used in establishing part (a) of the Asymptotic Transfer Theorem 12.41 the proof of 
part (b) will use 
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It is quite easy to transfer asymptotics for B to asymptotics for A using the ETT. We 
give three examples important for applications to moments of functionals in the next 
theorem, proved independently by Felix Chern and Hsien-Kuei Hwang in |Hj using a 
quite different approach. The series convergence required in (|2.11|) need not be absolute. 

Theorem 2.4 (Asymptotic Transfer Theorem (ATT)). 

(a) If 

b n = o(n) and S~] - — — £ — — converges, (2.11) 

^ [n + lHn + 2) 

n=0 

then 

K °° b 

a " = ^T n + o(n) ' where Jfi:= E ( 3 + i)(j + 2 ) - (2 - 12) 

(b) If b n = Kiin + 1) + h n where (h n ) satisfies (|2.11|) [with (b n ) replaced by (h n )[, 
then 

n ' 2 -nH n + ^ - n + o(n), (2.13) 



H m 1 H m 1 



where 



^(i + i)(i + 2) 



-f^m 1 j Hm J 



(2.14) 



(c) = K^ri" + o(n 1 ') wrei/i u > 1, i/ien 



a n = ^ n^ + oK). (2.15) 

r(i)+m) 

Of course part (a) is equally valid with Xl^=o a 1 replacing the series in (|2.11|) . 

Theorem 12.41 will be proved in Section 0J It is easy to see that the ratio ^^+1) 
appearing in (|2.15|) has modulus strictly less than unity; in particular, the expression 
given is well defined. Refined and additional asymptotic transfers will be discussed in 
Section 5. 

2.3 Limiting distributions 

Applications to moments and limiting distributions are discussed for "small" toll se- 
quences in Section [5] — see Theorems 15.11 and 15.41 "moderate" and "large" toll sequences 
are discussed in Section^ — see Theorems Ifi . 2 1 and Ifi . 51 We state here a summary theorem 
that can be deduced easily from these four theorems. [For the definition of mo{P) in 
case | (ii) | o f the theorem, see Remark 16. 31 Concerning the values of g\ and 52 in cases 
aadpjjj consult (|?T5|) . (Oil (and <P).] 

Theorem 2.5 (Limit theorem for additive functionals). Let X n = f(T n ) be the 
additive functional on random m-ary search trees corresponding to a toll sequence (t n ), 
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with T n having the distribution on trees induced by a uniformly distributed permutation 
on [n]. When (t n ) satisfies one of the three conditions (i) (iii) below, then 



X n — E A. 



W, 



with convergence of all moments. Here, with L denoting a slowly varying function, 

(i) If 2 < m < 26 and 

t 2 

(a) t n = o(y / n) and n max n s <Kjl -A < oo for some < 5 < 1 or 

(b) i n ~ y/nL(n), 

then W has the standard normal distribution. 

(ii) Ift n ~ n p L{n), with 1/2 < /3 < 1 and 2 < m < m Q ((5) , then W = g% 1/2 Y where 
C(Y) is the unique distribution with finite second moment satisfying (|6,1|) , i/ere 



52 



with 



(m- 1)! 

TO!r(2/3+lT 
r(2/3+m) 



(m- 1)! 



5i 



r(/? + i) 2 r 2 (/? + i) 

+ 2mgi - + m(m — 1)^ ■ 



•T(J3 + m) 
T{(3 + m) 



T(2/3 + m) 



>o, 

(2.16) 
(2.17) 



(iii) If t n ~ n@L(n) with (3 > 1, f/ien W = g 2 1 ^ 2 Y where C(Y) is the unique dis- 
tribution with finite second moment satisfying (|6.1|) . where g<i is again defined 

at dmnD - dnzD - 



Remark 2.6. In case | (iii)] it is easy to check that > and 52 > 0. In case (if 



one can 



verify easily that g\ < 0. In this case to see that 52 > 0, note that no constant random 
variable satisfies (|6.1|) . 



3 Proof of the ETT 

In this section we prove the ETT, which is Theorem 12.21 

Proof. For the proof of (|2.6|) . see Appendix El in particular Corollary IA.2I and Propo- 
sition's! To begin the proof of l|2.7|) . note that B can be replaced by B in (|2.6|) , We 
then use repeated integration by parts and invoke Identity IB.101 Denoting 

m—1 

A:=A(z)-^2c,(l-z)-^, 
3=1 

after m — 2 integrations by parts we find 

A ^ = E n y } ' (Aj + m — 2) ■ • • (Aj + 1) jf £'(0(1 - C) Aj dC- 
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But 

n x ib(\j)+m\ ml 
(A i + m -2)...(A i + l) = ^— = ^ = — , 

Aj Aj 

SO 

m— 1 . ,. z 

We obtain Q2.7JI by performing one more integration by parts and utilizing Identity IB. 91 
with A = 0. □ 



4 Proof of the ATT 

In this section we prove the ATT, which is Theorem 12,41 The following elementary 
result, which is found in the first line of the proof of Lemma 6 in [S], is key to the 
analysis of (|2.7jl . For completeness, we include a proof. 

Lemma 4.1. Let Y(z) = X^nLo^ ra2;n Ho = 0- F° r an V A G C ; 



l-z) 



(4.1) 

The product in (|4.1|) may be written (when A G C \ {0, —1, —2, . . .}) as 

T(A + ra)r(2 + A;) 
T{l + n)T{\ + k + iy 

which by Stirling's formula equals 

n"- 1 [l + O^- 1 )] 
(k + [l + O^fc + l)- 1 )] 1 • j 

for n > 1 and fe > 1. [The product in (|4.1|) equals (|4.2|) even if A G {0, -1, -2, . . .}.] 
Also, of special interest is the case A = 2, in which case (|4.1[) reduces to 

(l-,r^(l-C)y(C)dc)=(» + l)E ( , + 1 ^ + 2) . n>0. (4.3) 

Proof. The function W(z) := (1 — z)~ x J Q Z (1 — C) A1 ^(C) is the unique solution with 
W(0) = to the differential equation 

W'(z) = A(l - z)- l W{z) + (1 - z)- 1 ^^); 

that is, w n := [;z n ]W(.z), n > 0, satisfies wo = and 

^ n— 1 ^ n—l 

w n = - y^w k + - y^Vk, n>l. (4.4) 

fc=0 k=0 

But the recurrence (|4.4|) can be easily solved to yield (|4.1j) : compute the difference 
nw n — (n — l)w n -\ and iterate. □ 
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For part (a) of the ATT we use the following estimates from [5] ; these follow readily 
from (|4.1|) — (|4.$|) . [Part (a) is their Lemma 6; part (b) is used tacitly in the proof of 
their Proposition 7.] 

Lemma 4.2. 

(a) //Re(A) < 2 and Y(z) = Y2n°=oyn zn satisfies yo = and y n = o(n), then 



[z n ] ( (1 - J Y(0(l-() A - L d() =o{n). 



(b) With B defined at if (EOTI) holds, then 



o(n). □ 



For part (c) of the ATT, we will need the following simple comparison lemma. 

Lemma 4.3. // (b n ) and (b' n ) are two input sequences such that 

\°n\ <b' n for all n > 0, 

then the corresponding output sequences (a n ) and (a' n ) in Q2.3JI (with the initial condi- 
tions stated there) satisfy 

\ a n\ < o! n for all n > 0. 



Proof. This follows immediately by induction. 



□ 



Proof of Theorem \2.J\ (a) From (|2.7j) . assumption (|2.11|) . Lemma F4.21 (|2.1()|) . and ip'(2) 
ml(H m — 1), the result is immediate. 

(b) Suppose first that b n = n + 1. Then B(z) = (1 — z)~ 2 , so 

5 (m - 1} (z) =m!(l-z)- (m+1) . 

Plugging this into (j2.fi j) we find 

m— 1 



a„ = (n+1) 



ci 



+ m! 



1 



3=2 



(2-A i )^(A j ) 



+ ^) I*"] [(1 " z)- 2 log ((1 - zy 1 )] +o{n). 



Now we use Identities IBTTI and HTT21 Also, since b n = ti ~\~ 1 . we have from (|2. 10|) in 
this case that ci = 1. Therefore 



(n + 1) (1 + 5 



nH n + 



H m — 1 



(H m - 1)2 
1 1 



1 



+ 



+ 



1 



H m — 1 



[(n + l)H n -n] + o(n) 



H {2) - 1 



2 ff m - 1 2(# m - l) 2 



n + o(n). 
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This completes the proof of (b) for our special case, and the general case follows from 
this and part (a) using the superposition principle. 

(c) Suppose first that b n = (v+Vf/n). ~ n v /T(v+l), so that B(z) = (l-z)"^ 1 ) and 
B( m ~ 1 ')(z) = (v + l)™ i:T (l-zy ( - v+m \ Plugging this into (J22ID and utilizing Identity ESI 
with A = v + 1 and the calculation 



(v + 1) 



m—l 



we find 



A(z) 



{v + l)™" 1 

m\T(v + 1 



ml 



T(v + m) 



m\T{v + 1) 
T(v + m) 



(1- z)-^ +0(\l-z\- 2 ). 



By singularity analysis for large functions [12], this completes the proof of (c) for our 
special case. 

To complete the proof in the general case, we need only show that if b n = o{ri°) for 
v > 1, then a n = o{n v ). Indeed, fix e > 0; then there exists a sequence (b' n ) such that 
\b n \ < b' n for all n and 

b' n = e{y + 1) /re! for all large n. 

The toll sequence is but a slight modification of our special-case toll sequence, and we 
see that 



a' n = e'n v + o(n v ). 
Now Lemma 14.31 implies that 



where £ 



rc« + i) 



m\T(v + 1) 
T(v + m) 



limsup \a n \n v < e ; 



since e (and hence e') can be made arbitrarily small, this completes the proof. 



□ 



The conditions (|2.11|) on (b n ) are not only sufficient but also necessary for asymptotic 
linearity of a n . Indeed, here is a converse: 



Proposition 4.4. // a n = Kn + o(n) for some constant K, then (|2,11|) holds. 

Proof. Chern and Hwang 5 provide the simple proof that b n = o(n). Moreover, then, 
from (P77|) . (EHUj) . and (JOJ) with Y taken to be B, we find that 



n-l 

E 

j=0 



(i + i)(i + 2) 



iT(i? m -l)+o(l), 



i.e., that the series ^2^L (n+1 )( ra+2 ) converges [to i<C(# m - 1)]. 



□ 



The following additional asymptotic transfer results are established by calculations 
similar to those in the proof of the ATT. We leave detailed proofs as exercises for the 
reader. 
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Theorem 4.5 (more asymptotic transfers). Consider the initial value problem (]2.3[) . 

(a) If 2 < m < 26 and 6 n = o(i/n) ; i/ien we can refine (j2,12|) to 

tl m — 1 

(b) If Re X2 ='■ a < 1 + f3 and b n ~ n^L{n), where 1/2 < /3 < 1, u>ito, L slowly 
varying, then we can refine (j2.12j) to 



^1 

-n 



m—l 



==n p L(n) + o (n p L(n)\ . (4.6) 



i?m-l m!-(l+/3)' 
(c) If b n ~ nL(n) with L slowly varying, then, with K\ defined at ()2.12j) . 



_^_ nj if YT ^ < x . 



a « ~ \ 1 ^ L(fc) - f ^00 L{k) I 4 ' 7 ) 



(d) Pari (c) of the ATT can 6e extended as follows. If b n = K^n v L{n) +o(ri"L{n)) 
with v > 1 and L slowly varying, then 

a n = Jr ( , +1) n"L(n) + o (n"L(n)) . (4.8) 

f V(v+m) 

Proof hints. Whenever the conditions (|2,11|) are met we have by the ETT and (j4.3j) 

K 1 00 i 

°»-ft^T<" + 1 ) = °'"°- 1 ' + t °-^i'" + 1 )S ( , + 1) ^ + 2) («) 

fe=n 

m-1 , z v 

+™ ! E^"i (a-.)-* / *K)Ci-0*-'«), 



where a is strictly smaller than 1 + (3 by assumption. (When /? = 1/2 we know that 
a < 3/2 when m < 26.) Simple estimates, including the use of (|4.2j) . give cases (a) 
and (b); for (b), the coefficient of n^L{n) in ()4.6j) indeed is, using IdentitiesllT^and lBTTl 



1 1 
1 l rn I > 



m—l 



, 2 v-j) m!-(l + /3)— 1' 

In case (c), from the ETT result (|2.7|) and simple estimates we find 

1 n— 1 6 
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To finish, we use the regular variation fact (quoted by Hwang and Neininger ^1] at 
their equation (7) from Proposition 1.5.9a in [2]) that 

*w = ° (e ^f 1 ) • ( 4 - 10 ) 

\fc<n / 

In case (d), again from (|2.7|) and simple estimates we find 

m— 1 

E y (A . )(u + i_ A .) 

The proof is completed by using Identity El □ 



m— 1 j 

a n = 0{n) +b n + K A n v L{n)m\ V -r + o (n v L(n)) . 

r-f # A,- u + 1 - A,-) 



5 Asymptotic normality for small toll functions 

In this section we establish asymptotic normality for "small" toll functions. 
5.1 Central limit theorem statements 

As did Hwang and Neininger in ,145 , in our Theorems 15.11 and 15.41 we treat two over- 
lapping cases. Throughout, we write Y2j as shorthand for the sum over m-tuples 
(ji) • • • ,jm) of nonnegative integers summing to n — (m — 1). 

Theorem 5.1 (CLT I for small toll functions). If2< rn < 26 and the toll sequence 
(t n ) satisfies 

(a) t n = o (y/n) and (b) n~ l max -y < oo for some < S < 1, (5.1) 

— ' n s <k<n K 

then the mean /x n and variance of the corresponding additive functional X n on m-ary 
search trees with the random permutation model satisfy, respectively, 



= ~lj T n + W™) =■ fJ-n + o [y/n) , (5.2) 

tl m — 1 

with K\ defined at (|2.12l) . and 

1 °° r 

al = a\ + o(n), where a* := ^— y g ( . + ^ + 2) , (5.3) 

mf/i the sequence (r n ) defined by rj := for < j < m — 2 and 

r n : = [*" + Z^ii + ' ' ' + - /^n] 2 , n > m - 1. (5.4) 

lm-1/ j 

Moreover, 

— is asymptotically N(0, a ), 

and there is convergence of moments of every order. 
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Remark 5.2. One can check (for any 2 < m < oo) that the variance a 2 , vanishes for all 
n > m — 1 if and only if the toll sequence is chosen as 

t n = t min{m — 1, n}, n>0 

for some constant t € R. In that case without loss of generality t = 1 and then X n = n 
is just the number of keys and we have exact (though degenerate) normality. So we 
shall assume in the proof of Theorem 15.11 that a 2 > 0. 

Remark 5.3. (a) Condition (|5.1|) (b) trivially implies 

00 f 2 

£^<oo, (5.5) 

which in turn implies that (|2.11|) holds with absolute convergence; indeed, since the 
nonnegative numbers [(n + l)(n + 2)] , n > 0, sum to unity, we have 



E 



o ( n + l)(n + 2) 



00 f 2 

<V 7 ^t<oo. (5.6) 



(b) If 

t °° i 2 

\t n \=0(t n ) with0<-^| and V^<oo, (5.7) 

then we claim that ()5.1|) holds, and then as a corollary 



t 00 +2 

-===10 and Y -%< 



implies (|5.1|) . To see the claim, first observe that the condition (|5.7[l certainly im- 
plies (|5.1j) (a): moreover, we observe that the series (say, over 2 < n < oo) in (|5.1|) (b) is 
bounded by a constant times 



oo ~ 2 00 t ? gl oo 72 / » 

max fc - V n" 1 1 - 1 - V k V n" 1 = O V — 

n=2 n=2 1 1 k=2 (fc-l)V« <n<fc 1 /<5 \fc=2 



< OO. 



(c) One can check that, when m = 2, the proof we will give of CLT I requires 
only (|5.1|) (a) and ([5.5)1 . In that case we obtain a strengthening of "Case SI" of Theorem 2 
in ^3] (for deterministic toll sequences); they required t n = O (y / n/(logn)( 1 / 2 ) +£ ) for 
some e > 0. 

Theorem 5.4 (CLT II for small toll functions). If 2 < m < 26 and the toll sequence 
(t n ) satisfies 

t n ~ y/nL(n) (5.8) 

with L slowly varying, then the mean [x n of the corresponding additive functional X n on 
m-ary search trees with the random permutation model satisfies 



= H^i n - m! ~ (3/2)^ ^<") + ° (^ L '"» • < 5 - 9 > 
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with K\ defined at (|2.12|) . If ^2°° L ^ < 00 > ^ en ^ e variance a 2 satisfies (|5.3[) - ([5.4j) 

and we define 



s 2 (n) := <7 2 n. 



L 2 (fc) 



00, i/ien 



2 2/ \ 2 
<7 n ~ s (nj := ex n 



k<n 



L 2 (k) 



where in this case we define 



a 2 :- 



((|) m_1 ) 2 - 1) + 1] - H) 2 



Moreover, in either case 

X n - \xn 
s(n) 



(H m -1) m l-a)m-l 



is asymptotically standard normal, 



(5.10) 



(5.11) 



and there is convergence of moments of every order. 

Remark 5.5. When m = 2 the constant a 2 in Q5.11[) equals |vr— 14, and Theorem 15 . 41 re- 
duces to "Case S2" of Theorem 2 in ^1] (for deterministic toll sequences): see especially 
their displays (15) and (17), with T2 = 1. 



5.2 Central limit theorem proofs 

Proof of CLT I (Theorem \5.1\) . We use the method of moments together with asymp- 
totic transfer results. 

Given the toll sequence (t n ) defining the sequence (X n ) of random functionals of 
interest, the means (/z n ) satisfy the basic recurrence relation (j2.3|) with (b n ) replaced by 
(t n ). Thus (|5.2j) simply repeats the asymptotic transfer result (|4.5[) . 

According to the law of total variance, the sequence (<r 2 ) of variances also satisfies 
the recurrence ()2.3|) . but with (b n ) replaced by (r n ). According to Lemma l5~B1 to follow, 
the sequence (rn) satisfies the conditions (|2. 11|) . [Note: When m — 2, only (|5. lj) (a) 
and (|5.5|) . not (|5.1[) fb). are needed in the proof of Lemma IS~o1 ] Then ()5.3j) is immediate 
from part (a) of the ATT. 

Let X n := X n — fj,(n + 1) for n > 0. We will complete the proof of CLT I by showing 
by induction on k that 

fin(k) :=ElJ, k > 1 [with /} n (0) := 1] (5.12) 

satisfies 

An (2 A;) ~ ^° 2k n k , k > 1 (5.13) 

and 

fi n (2k - 1) = (n k ~W 2 A , fc > 1. (5.14) 
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Observe that (f5~2|) - ((5~3|) imply that (fS~T3|) and (|S~l"4l both hold for k = 1. 

The key to the induction step for both (|5.13|) and ()5.14|) is to apply the law of 
total expectation to IJ5.12J) . by conditioning on the subtree sizes \Li\, . . . , \ L m \ (recall 
the notation in Section [J) . In a manner similar to (|2.1|) , we have 

n-(m-l) , _ ^ _ \ 

An(*0 = T^T E \ _, 3 )Hk)+r n {k), (5.15) 
\m-l) j=o V m ' 

where 

rn(k):=jy( h I h )^ m+1 XrirE^^)---^(^) (5.16) 
k . . . , n, m , K m +iy \m-l) j 

with ^£ here denoting the same sum as Y^ k with the additional restriction that ki < k 
for i = 1, . . . , m. Observe that (|5.15j) is again of the basic form (|2..3|) , We will apply the 
ATT after evaluating r n {k) asymptotically. 

We will treat the induction step in detail only for ()5.13j) . the case (j5.14|) being 
similar and somewhat easier. Let Ylk denote the sum over m-tuples (ki, ■ ■ ■ , k m ) of 
nonnegative integers, each < k, summing to k (i.e., the same sum as Ylu with the 
additional restriction that k m+ i = 0). For k > 2 we clearly have, by induction [recall 

r n (2k) = o(n k ) + L ^ 2k ) T ^rT^»n(2h)---^ m (2km) 

k \ 1) • • • j raj \ m __i) j 



k v--m • • • ' Vm-lV 



But 



Jl \ ( 3m 



, , , v n / \ n 

Vm-i; j v / v 

fcm-1 /-i _ \fc 



(m - 1)! y xl 1 ■ ■ ■ x™_l (1 - xi x m -i) km dxf- dx m -i 

r(fci + l)...r(Av» + l) 1 



(m- 1)!- 



\fci,...,fe m / V to— 1 / 



where the above integral is over (x\, . . . , x m -%) G [0, l] m 1 with sum not exceeding 
unity. Since the number of terms in Y^£* is ( k ^^~[ l ) — m, we therefore have 

(2*0 ' 2fc fc^m-T 1 ) ~™ , fex 
V TO— 1 / 



( 2A ! 2fc fc 

2 k kl 



mlT(k + 1) 

r(fc + m) 



+ o(n fc ), A;>2. 
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Similarly, 

r n {2k - 1) = o (n k -^ 2 ^ , k>2. 
Now part (c) of the ATT implies Q5.13[) and (|5.14|) . □ 

The following lemma lies at the heart of the proof of Theorem 15.11 

Lemma 5.6. In the context of CLT I, the sequence (r n ) defined at (|5.4|) satisfies the 
conditions (|2.11j) . 

Proof. Clearly 



ITT [*n + An H + / i j m -An] 2 , n>m-l, 



\m— 1/ 



with 



An := - n(n + 1) 
which is o(y/n) by (|5.2|) . Recall the inequality 



,i=l 



< 



i=i 



for real numbers £i, • • ■ , Applying this to (|5.17|1 . 

n— (m— 1) 



9 n m \ ^ 



j=0 



n-l-j 
m — 2 



(5.17) 
(5.18) 

(5.19) 
(5.20) 



m + 2 

from which (|2.11j) fa) for (r n ) is evident. 

To establish the summability of r n /n 2 we need only establish that of fl^/n 2 . Indeed 
we can then use (|5.2Uj) again, together with Q5.5|) and the estimate 



£ 



-2 



n-(m-l) , A 

" (~\) ^ V rn-2 ri 

1 Vm-17 j=Q \ / 

m(m - 1) 2^ ^ X 



j'=0 n=j+m- 

OO OO 



_ x n 2 (n) m _i 



,7=0 n=j+m— 1 



£ 



A? 



< OO. 



To establish the summability of jl^/n 2 , we recall from 1)4 . 9 j) and (|4.2|) that 



Mr! 



-. oo 



m— 1 



L in 

n-1 



(k + l)(k + 2) 



(5.21) 



i=2 



fc=0 



14 1 

(& + l) ( 
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writing ay := ReAj (with a = ai < 3/2, since m < 26). Using l|5.19|) . we need only 
establish the summability of n~ 2 times the square of each of the four terms on the 
right in (|5.21jl . The first of these verifications is trivial, and the second was carried out 
at H5.5j) , For the third we apply the Cauchy-Schwarz inequality [compare H5.6|) ] to give 



E 

.k=n 



(fc + !)(*; + 2) 



whence 



< 



n 



E 

.k=n 



Vk \t k \ 



k=n 



k=n 



E 



E 

.k=n 



(k + l)(k + 2) 



k=n 



o(E^ 1/2 ]=o(E|)< 



by (|5.5|) again. 

We pause here to note that when m = 2 the proof is finished here, and that up to 
now we have used only (|5.5|) . not the stronger assumption (|5.1(l (b). 

For our fourth and final verification, it suffices [again by invoking (|5.19jl ] to establish 
the summability of 

E 1 141 
kP 



n 



2p-4 



.k=l 



(5.22) 



for any real p < 3/2. To do this, we break the sum into ^2 k<n s and J2n s <k<n an( ^ once 
again invoke (|5.19|) . In the range Ylk<n s we simply use t k = O [Vkj and note 



n 



2p-4 



fc<n a 



3-2p 



0(n T ) 



with r < — 1. In the range E^ n 5 <fc<n we use Cauchy-Schwarz again: 



n 



2p-4 



E \tk\ 
kP 

n 6 <k<n 



n 2 '- l n 3 - 2 ' 



E 

n s <k<n 



(3/2)-p 



fc(l/2)-p (2 



n s <k<n 



n^/ 2 )-P k 



-£ = O n 1 max -r 



n s <k<n k 



which is summable by assumption (|5.1|) (b). 

Now we proceed to our "borderline small" case. 



□ 
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Proof of CLT II (Theorem \5.4\ )- Again we use the method of moments together with 
asymptotic transfer results. Given the similarity to the proof of CLT I, we will be brief 
here. 

Equation Q5.9JI simply repeats the asymptotic transfer result l|4.6jl . As before, (a 2 ) 
satisfies the recurrence (|2.3jl with (b n ) replaced by (r n ) of (|5.17|l - (|5.18|) . where again 
H := Ki/(H m — l) and rj := for < j < m—2. Here the proofs diverge somewhat. The 
analogue of Lemma 15.61 is Lemma 15.71 below. Then the asymptotic variance assertions 
of CLT II follow immediately from Theorem I4,5f c). 



if e c 



< oo, then from (|4.1()|) applied to I? it follows that (|5.12j) satis- 



fies (|5.13|) - ()5.14|) for k = 1. Then higher moments are treated exactly as in the proof 
of CLT I to complete the proof of CLT II. 



oo L 2 (k) 



oo, then one uses (|4.1U|) . Theorem 14. 5( d). and induction to show that 



the moments (j5.12|) satisfy 



fin{2k) 



2 k k\ 1 h 



k > 1 



(5.23) 



and 



ji n {2k - 1) = o (s 



,2fc-l, 



n , 



k > 1 



(5.24) 

and thereby complete the proof of CLT II. We omit the details. □ 

The following cousin to Lemma 15.61 was used in the proof of Theorem 15.41 
Lemma 5.7. In the context of CLT II, the sequence (r n ) defined for n > m — 1 by 

t -f /% -\ h Hj n ~ fJ>n\ 2 

J 

satisfies 



Vm-l/ 



r n ~ (H m - l)a 2 nL 2 (n). 
Proof. By jOJ), with 9 := (3/2)™ rT /(m! - (3/2)"^), we have 

l — Y, [n 1/2 L{n) - Oj^Hn) ejU 2 LUm) + On^lA 



(m-l) 



nL 2 (n) (m — 1)! 



n 



(1 + 



m— 1 

'E 

i=l 



1/2 



m-j \V 2 ' 



dxi ■ --dXm-1. 



where the integral, call it J, is over {x\, . . . ,x m ~\) G [0, l] m_1 with sum not exceeding 
unity. To complete the proof we need only show J = (H m — l)a 2 /(m — 1)!, with a 2 
defined at (15.111). 



19 



Indeed, 



J 



m—l 



m—l 



8=1 

m—l 



i=l 



20(1 + 0) J>, 1/2 - 20(1 + 0) (l-^ 



m—l 



1/2 



i=l 



\ i=l / 
1 /2 

m—l \ ' m—l 



\ i=l 

1 



.1 ; 



i=i 



1/2 



[(l + 0) 2 + 21 



[(1 + ^)2 + 021 



(m - 1)! 
1 

(m- 1)! 



20(1 + fl)m r ( 3 / 2 ^ + 6l 2 m(m - 1) 



cZxi • • • dx m -i 

T(3/2)] 2 



r(m+(l/2)) 



r(m + 1) 



20(1 + 0)m 



1 



+ 



vr/4 



(3/2)™" 1 (m-2)!' 



Plugging in the value of and simplifying, we obtain J = (H m — l)a 2 /(m — 1)!, as 
desired. 

□ 



5.3 Periodicity for m > 27 

If t n = o (\/n) as in CLT I but m > 27, then the remainder term jl n := \i n — \x{n + 1) 
for the mean — which by (j5,2[) was o(y / n) when m < 26 — now satisfies, by the ETT 
and [compare 

An = C 2 t ,-_ , + C 3 



r(A 2 ) °r(A 3 

m—l 



i / \ 

+m! E ^(Xo n v (1 ~ " r ' J Jo f (c)(1 " c)Aj_1 7 

+o(n ReA4 " 1 ) . (5.25) 



Typically this will lead to the negative result that (/2 n ) [and hence also (r n ) and 
(<r 2 )] suffers from periodicity and that there is no natural distributional limit for nor- 
malized X n . Examples are the space requirement functional studied by Chern and 
Hwang [Sj (and others before them ^H]) and the shape functional 9 . For recent devel- 
opments in the case m > 26 for the space requirement see [I]. 

But it is perhaps difficult to establish a general negative result, due to cancellations. 
For example, suppose T(z) equals (1 — z)^ 1 , so that t n = 1, as studied by Chern and 
Hwang except perhaps that the initial values to, ... , t m -2 are changed. Then 

T^ m - X \z) = (m- 1)!(1 - z)- m , 
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whence 

oo m—1 m—1 \— A pz 

A(z) = ^ » n z n = - *T Xj + (™ - 1)' E ,m / / C 1 " ^ Aj '" 2 dC 

r,=1 ,=1 fe1 ™ V^jJ JO 



n=l 3=1 j=l 

Now it is possible to choose to, . . . , t m _2 so that 

(m- 1)! 



(l-W(Ai) : 



l,...,m-l. (5.26) 



In that case A(z) = —^^-(1 — z)~ c , whence jl n = fi n = — l/(m — 1) [and we see that 
the chosen values of to, . . . ,t m -2 are all — l/(m — 1)], and we get linear variance and 
asymptotic normality, just as in CLT I, for every 2 < m < oo. 

One might object that the above example is artificial, in that the toll sequence 
changes sign. But the same calculation show that if the toll sequence is chosen as above 
(t n = 1) but with inital values 

tj := K(j + 1) - 0<j<m-2, 

then still, for every m > 2, the sequence (jl n ) is constant, the variance is linear, and 
we have asymptotic normality. Further, (t n ) is nonnegative provided K > l/(m — 1). 
[We remark in passing that the choice K = l/(m — 1) leads to the degenerate case 
of Remark 15.21 ] The sequence (t n ) is also nondecreasing (as in most real examples) 
provided K < m/(m — l) 2 . 



6 Moderate and large toll functions 

In order to describe the limiting distribution of X n for moderate and large tolls, we 
will introduce a family of random variables Y = Y{(5) defined for j3 > 1/2, (3^1. 
Anticipating Lemma 16.11 we need to consider the distributional equation 

m 

y = ^s^ + i (6.i) 

3=1 

Here (5^)™ i are independent copies of Y and (Si, . . . , S m ) is uniformly distributed on 
the (m — l)-simplex, independent of (Yj) 1 JL 1 . Recall that the (m — l)-simplex is the set 

{(si, . . . , s m ) '■ Sj > for 1 < j < m and s + = 1}, 

where s + denotes X)j=i s j- 

Let C/(!) , . . . , C/( m _i) be the order statistics of a sample of size m—1 from the uniform 
distribution on (0, 1). They have joint density 

fu m ,...,u (m _ 1) {xi, ■ ■ ■ ,x m ^i) = (m- 1)!1(0 <xi < • •• < x m -i < 1) 
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with respect to Lebesgue measure on M m_1 , where 1(A) is the indicator of A. 

By a change of variables, we find that the joint distribution of the spacings 
Si,... , S m , defined, with U( \ := and Ur m \ := 1, by 

Si ■= - J7 ( j_!), i = 1, ...,m, 

is uniform over the (m — l)-simplex: 

fsi,...,s m -i{si, ■ • -,s m -i) = (m- l)M{sj > 0,j = 1,... ,m- 1; s + < 1) . 

When rj > — 1 for 1 < j < m, observe that 



E 



IK 



j 



- (m - 1)! / i>Q S ? • • • C-i (1 " s + ) rm • • -dsi 

J ' s' + <l 

= : (m — l)!i?(ri + 1, . . . , r m + 1) (defining B as the integral) (6.2) 

nr=i rfo + 1) 



(m-iy. 



r(ri H h r m + m) 



Lemma 6.1. Fix (3 > 1/2 wrai/i /3 7^ 1. T/ien i/iere exists a unique distribution C(Y) = 
C(Y((3)) with finite second moment satisfying the distributional identity (|6.1|) . 

Proof. We first observe the that mean of any such distribution is determined by ()6.1|) , 
Indeed, by taking expectations in (jfi.lj) and using (|fi.2|) . we get 



r(/3 + m) 

since (3^1. Thus we can equivalently consider the distributional identity 



m 

C 



W = Yl S j W i + ^ 

5=1 

where 

m 

tf:=l- M + ^Sf. 

i=i 

Here W is restricted to have mean and finite second moment, (W^JL-^ are indepen- 
dent copies of W, and (Si,... ,S m ) is uniformly distributed on the (m — l)-simplex, 
independent of (Wj)™^. 

We now employ a standard contraction-method argument [231 [2D • Let d 2 denote the 
metric on A^2(0), the space of probability distributions with mean and finite variance, 
defined by 

d2(Gi,G 2 ) := min||X 2 - Xi|| 2 , 

taking the minimum over all pairs of random variables Xi and X 2 defined on a common 
probability space with C(X\) = G\ and C(X 2 ) = G 2 . Here ||-||2 denotes L2-norm. 
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Let T be the map 

(m 
S^Xj + H 

where (Xj)^L 1 are independent with C(Xj) = G, j = l,...,m, and (Si,...,S m ) is 
uniformly distributed on the (m — l)-simplex, independent of (Xj)™ =1 . We show that 
T is a contraction on ^2(0); more precisely, that there exists a p < 1 such that 

d 2 (T(C(A)),T(C(B))) < pd 2 {C{A),C{B)) 

for all pairs £(A) and C{B) in M 2 (0). To bound d 2 (r(£(yl)), T(C{B))), we couple 
T(£(.A)) and T(C(B)) by taking to independent copies (Aj,Bj) of the ^-optimally 
coupled (A, B), an independent (Si, . . . , S m ), and defining 



A' := S j A 3 + H ~ T (£( A )), B ' ■= Yl S J B i + H ~ T ( £ ( jB ))- 
3=1 j'=i 

Now, defining S := (Si, . . . , S m ) and using the law of total variance, 
d 2 {C{T(A)lC(T{B))f 



< \\B' -A'\\l 



3=1 



Var 



Y J S P j {B j -A j 

3=1 



EVar 



3=1 



+ VarE 



J2S^(B j -A j 

3=1 



£>Sf )Var [5, - ^] = d 2 (£(A),£(i?)) 2 ^ESf 
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3=1 



We need only verify that 



2 ,r(2/3 + i 

p := to! 



???! 



T(2f3 + m) (2/3 + m- 1) • • • (2/3 + 1) 



T(2/3 + l) 
r(2/3 + m 



< 1, 



■d 2 (£(^),/:(i?)) 2 



which is true when /? > 1/2. The existence and uniqueness of C(Y) now follows from 
the Banach fixed point theorem |24l Theorem 2]. □ 

6.1 Moderate toll functions 

In the case of moderate toll functions, convergence in distribution and convergence of 
all moments can be stated as 

Theorem 6.2. If the toll sequence (t n ) satisfies 

t n ~ •nPhin) with 1/2 < (3 < 1, 
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where L is a slowly varying function and a < 1 + /3, then the mean of the correspond- 
ing additive functional X n on m-ary search trees with the random permutation model 
satisfies 



u n = iin n^L(n) + o(n^L(n)), u := , (6.3) 

urei/i iTi defined at (|2.12|) . Moreover, 

X n - [in c 



with convergence of all moments. 



Remark 6.3. It is well known that a < 3/2 for m < 26. In Theorem IB .71 we show that 
a increases with m. Thus for a fixed (3 € (1/2, 1), the condition a < 1 + /3 is equivalent 
to m < mo for some mo > 26. 

Proof of Theorem \6.°A We use the notation introduced in the proof of Theorem 15.11 
in Section 15.21 Equation (|6.3|) is simply a restatement of the asymptotic transfer re- 
sult 

We show that the moments fl n (k) satisfy 

^(jfe) = g k n kp L k {n) + o{n kp L k {n)) as n -» oo. (6.4) 
The claim holds for k = 1 with 

51 : " m !-(l + /3)— " V r(/3 + m) J • (6 ' 5) 
Using l|5.1fij) . by induction we get, for k > 2, 
r n (jfc) = o(n fc/3 L fc (n)) 

+ fc * » +1 )(»' J i(»))' 

X E ^ Of ^Ol))* 1 • • • Sfc«Cto7m))*" 

Vm-17 j 

= o(n fc/3 L fc (n)) 

, . . . , fv m , t m +iy 



lm-1 

But [recall the definition of 5 at (|6.2j) 
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so that 

r n (k) = o(n k ^L k (n)) 

+ n^L k (n)(m-iy.y* ( \ ) g kl ■ ■ ■ gkm B(ktf + 1, . . . , k m /3 + 1). 

\Ki, . . . , K m , Km+xJ 

Using Theorem I4.5( with v = k/3 > 1, we get 

+ " t ^ t W I '"^U E* (,,,..., £„, w )* • • • + 1, ■ ■ ■ , w + 1). 

1 r(fc/3+m) k V ' m ' m+i/ 

Thus, defining g k recursively as 

g* = , { ZnkLi) E* L \ W-- fffem £(^ + i,...,Aw3 + i), (6.6) 

1 r(fc/M-m) k v ' m+1/ 

with <7o = 1) we see that (|6.4(l holds for all k > 0. 

By Lemma |6.4I ftp follow) and the method of moments (cf., e.g., [7[ Sections 4.4 
and 4.5]), the g^s are the moments of a uniquely determined distribution, say C(Y), 
and 

X n - [in c 



Y 



with convergence of all moments. It remains to show that Y = Y{(3). 
Define 

m 

Y:=J2S?Y j + l, (6.7) 
j'=i 

where (Y^)^! =1 are independent copies of Y and (Si, . . . , S m ) is uniformly distributed 
on the (m — l)-simplex, independent of (Yj)J! =1 . We will show that Y = Y, and then, 
by (|6.7|) . £(Y) satisfies the distributional identity ()6.1j) and has finite second moment. 
By Lemma l6~Tl Y = Y(/3), as desired. 

^ £ ~ ^ ~ 

To show Y = Y, it suffices to show that Y and Y have the same moments. Letting 
^ k denote (as before) the sum over (m+l)-tuples (k±, . . . , k m+ i) of nonnegative integers 
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summing to k, and using (|6.7j) . (|6.2[) . and (|6.6f) . 



Ey ^ Gi,---^m+i) 



E 



/ci . . . . . k 



(m - ly.Bihp + l,...k m /3 + l)g kl ■ ■ -g kn 



(m-1)!^ 



k\, ■ ■ ■ , AVra+l 



B(fci/3 + 1, . . . + l)g kl ■ ■ ■ g kn 



+ m\B(k/3+ 1,1,..., l)g k 



1 



m\T(k(3 + l) 
T{k(5 + m) 



m\T(k(3 + 1) _ - fc 



where (as before) denotes the same sum as ^ k with the additional restriction that 
ki < k for i = 1, . . . , m. □ 

Lemma 6.4. The moments {g k ) uniquely determine the distribution C(Y). 

Proof. Define 7& := g k jk\. It suffices to show (by Carleman's condition) that there 
exists an M such that j k < M k for all k sufficiently large. We proceed by induction. 
Indeed, by ()6.6|) we know 



Ik 



(m - 1)! 



TTE*r^TT (fllkA B(h/3 + l,...,k m + l) 



1 T{kl3+m) k 



< M 



u (m—l)\ s-^* M~ km+1 



r(fc/3+m) k 

by the induction hypothesis. So it is certainly sufficient to show that 
n-^* M~ km+1 

E — — r B{k x (3 + i,...,k m l3 + i) 



k m+ il 



]\,f-k m+1 



E -T r r((A;-A :m+ i)/? + ^)" 1 E n r ^ +1 ) 



0<ki,...,k m <k j=l 



km + l=0 

ki H \~k m = k k m -^-± 

^ as A; — > oo. (6. 
For this, fix a value of fc m +i S {0, 1, 2, . . .}, and consider the sum 



e ri r (^ +i )- 



(6.9) 



0<&i,...,£; m <£; j=l 



By log-convexity of V ^ 6.4.1], taking / to be (0, oo) and g to be logT in Proposi- 
tion 3.C.1 of j20|, the logarithm of the product in ()6.9j) is Schur-convex on (0, oo) m . 
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Thus applying Proposition 5.C.2 of |2U| with m = there, the biggest terms in the sum 
correspond to kj = k — k m+ \ for j equal to some jo and kj = otherwise; together, 
these m terms contribute mT{{k — k m+ i)(3 + 1) to the sum. If k > k m+ i, there are 
other terms, the biggest of which corresponds to having one of the kj's be k — — 1, 
one be 1, and the rest be 0. (This follows from Proposition 5.C.1 of [201 with m = 
and M = k — k m+ i — 1 there.) The total number of terms in the sum ()6.9I) is at most 
{ m ~ 1+ m-i m+1 ^) • S° the remaining contribution to (|6.9|) is at most 

m " 1 + {k : km+l) ) m - k m+1 - i)p + i)r(/3 + 1). 

m — 1 J 
We have found that the left side of (|6.8|) is bounded by 



V -l(k rn+1 < k) f(k - k rn+1 ), 

k m+1 =0 



where 



" (^ + i)-(^ + m-l) + r(/? + 1} (fU0) 

r -y )r(fc/3+1) l ( fc + l)--(fc + (m-l)) 

r(£:/3 + m ) " (m-1)! + 1) • • • (A;/? + (m - 1))' { ' ' 

which is a bounded function of k. To apply the dominated convergence theorem, it 
suffices to show that the right side of (|6,1()|) tends to as k — > oo, which follows from 
Stirling's approximation and the fact that (3 > 0. □ 

When t n satisfies the conditions in Theorem 16 . 21 but a > 1 + /3 then [compare (|5.25|) ]. 

fin = C2 „,, r + C 3 



r(A 2 ) °r(A 



+ ml E ^x~) [«"] ((! - z )~ X] [ f (0(1 - o^- 1 dc) 



+ o(n /3 ) + 0( n Rc ( A4 - 1 )), 



and typically this leads to periodicity. 



6.2 Large toll functions 

If t n ~ n^L(n), where > 1 and L is slowly varying function, then we have convergence 
in distribution for all values of m. We state the result, omitting the proof, as it is very 
similar to that of Theorem 16.21 

Theorem 6.5. // the toll sequence (t n ) satisfies 

t n ~ n^L(n), with j3 > 1, 
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where L is a slowly varying function, then 

nPL{n) ^ (P> 

with convergence of all moments, where C(Y(f3)) is the unique distribution satisfy- 
ing (|6.1|) . 

Presumably, the borderline case t n ~ nL(n) where L is a slowly varying function can 
also be handled using the techniques of this paper, but we have not pursued this. The 
specific choice t n = n — (m — 1) for n > m — 1 corresponds to the well-studied total path 
length of a random m-ary search tree. The corresponding additive functional measures 
the number of basic operations in m-ary Quicksort. As is well known, the number of 
basic operations has mean 0(nlogn) and standard deviation 0(n). See |13| for details 
in this case and |22| Corollary 5.2] for a characterization of the limiting distribution of 
the path length. 

A Solution of an Euler differential equation 

We now provide the proof of equation (|2.6|) in Theorem 12.21 which states the general 
solution of the differential equation (|2.4ft with initial conditions aj = bj, < j < m — 2. 
This linear differential equation can be written in the form 

Lg = h (A.l) 

where the operator L is defined as 

(Lg)(z) := g^ m ~ l \z) - m!(l - z)-^g(z). (A.2) 

We seek the solution g = A corresponding to input h = B^ m ^ 1 \ 

Equations of the form ()A.l|) - (jA.2|) are members of a class known as Euler differential 
equations. In this appendix we discuss a general method for solving Euler equations, 
restricting attention, for the sake of definiteness and practicality, to (|A.1|) - (|A.2|) . We 
assume that the reader is familiar with the theory of linear differential equations with 
constant coefficients (see, e.g., [S]). 

For brevity we have omitted several routine proofs. A fuller version of Appendices lAl 
and IB.2| mav be found in the technical report 

The homogeneous solution 

The technique for solving Lg = is quite easily summarized: make the change of 
variable z = 1 — e~ x , that is, x = ln((l — z) -1 ). For notational convenience we will 
abbreviate ln((l — z)" 1 ) as L(z). Then consider the function g defined by 



g{x) := g(l - e x ), i.e., g(z) = g{L(z)). 



(A.3) 
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Lemma A.l. The derivatives of g are related to those of g by 

k 



3=0 



(A.4) 



where [^] denotes a signless Stirling number of the first kind. 

Proof. The proof is a straightforward induction on k using standard identities for the 
Stirling numbers |15[ § 1.2.6]. □ 

The left-hand side of (TOT) can hence be expressed as 



m—1 



(Lg)(z) = (1 - z)~^ { £ 

3=0 



m — 1 
3 



~g^{L{z))-m\~g{L{z)) 



so that solving ~Lg = is equivalent to solving ~Lg = 0, where 

m—1 



3=0 



m — 1 

3 



g^\x) — m\g{x). 



(A.5) 



But this is a linear differential equation with constant coefficients. Its indicial polyno- 
mial, or characteristic polynomial, is 



m—1 



m—1 
. 3 . 



\i - m \ = A m_1 



mi, 



(A.6) 



^ m (X) = ij)(X) := Y, 
3=0 

the last equality following from |15l 1.2.6-(44)]. For more on this polynomial see Ap- 
pendix El From the discussion in ^7j it follows that there are m — 1 distinct (and 
nonzero) roots of if), call them Ai,...,A TO _j arranged in nonincreasing order of real 
parts. Thus the functions exp (\jx) are m — 1 linearly independent solutions of QA.5j) 
and hence the functions (1 — z)~ Xj form a basis of solutions to Lg = 0. 



A particular solution 

Lemma A. 2. The particular solution to hg 
( through order m — 2) is 

m—1 \ X . 



h with vanishing initial conditions 



9 P ( Z ) = E 



3=1 



iP'(Xj) 



MC)(i - c) Aj+m " 2 d(. 



The initial conditions 

Having computed a basis of solutions to the homogeneous equation and a particular 
solution to the inhomogeneous equation, so far we have established equation (|2.6fl in 
Theorem |2 i 2| modulo determination of the coefficients c\, . . . , c m _i at (EH) . 

The fact that the initial conditions for g p vanish make it simple to solve Lg = h 
for specified initial conditions: One need only match up the initial conditions of the 
homogeneous solutions. 
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Proposition A. 3. If the general complementary solution to L<? = h is written in the 
form 



m—1 



3=1 



then the constants Aj are given by 

, m-2 (k) ( m 

B Properties of the indicial polynomial 

The indicial polynomial 

^(A) = ^ m (A) :=A^-m! (B.l) 

plays an important role in the analysis of random m-ary search trees. We will enumerate 
a few useful identities involving the polynomial in this appendix. 

It is well known ^1 Chapter 3] that ip m has m — 1 distinct roots 2 = Ai, A2, • • • , A m _i 
listed in nonincreasing order of real part. As in |17l Chapter 3] we introduce 

a = a m := max Re(A 3 ); 
2<j<m— 1 

that is, a is the second largest real part among all the roots of the indicial polyno- 
mial ()B.1|) . We list some important properties of the roots of (|B.1|) stated in [T3 §3.3]: 

(i) The number — m is a root if and only if m is odd. All other roots of ^(A) are 
simple, non-real roots. 

(ii) No two roots have the same real part unless they are mutually conjugate. (This 
follows from the strict increasingness of |(s + it) m_1 | in \t\.) 

B.l Monotonicity of a in m 

One can check easily that for 3 < m < 6, a is strictly increasing in m. We will now prove 
this fact (Theorem IB .7(1 for all m > 3. To do so we build upon ideas in Appendix A 
of QH. 

Claim B.l. For any m > 6 and —00 < x < 2, 

g m (x) := inf{y > : (2 — x + iy){2> — x + iy) . . . (m — x + iy) is positive real} (B.2) 
is positive and finite, and the infimum is achieved. 
Proof. If z = 2 — x + iy with —00 < x < 2 and y > 0, then 

m m 

y 



> arg(j — x + iy) = > arctan ■ 
^— ' 7 — x 
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This last expression is strictly increasing in y > 0, with value at and (for to > 6) 
limit 

m _ 

E7T , .7T Of 
- = ( m - 1)- > 
2 2 ~ 2 

3=2 

as y — > oo. It is therefore clear that g m (x) is positive and finite and in fact is charac- 
terized as the unique root to 

y arctan — = 2ir. 

3=2 J 

The set on the right in l)B,2|) is discrete, and g m (x) is its smallest element. □ 
Claim B.2. The function g m in (|B.2|) is (form > 6) strictly decreasing in — oo < x < 2. 
Proof. If — oo < x\ < X2 < 2, then 

2-7T = arctan ^ < arctan ® m - 1 - , 
7 — X] ^-^ j — Xo 

3=2 J 1 j=2 J 1 

so g m (x 2 ) < g m {x-i). □ 

Claim B.3. The function g m in (|B.2|) is (for to > 6) continuous in — oo < x < 2. 

Proof. Let — oo < xi < X2 < 2. In Claim lB~2l we have seen g m {x2) < gm(xi). To 
complete the proof of Claim IB,3| we will show that 

g m {x 2 ) > - — —g m (xi). (B.3) 
2 — xi 

Indeed, 

m 

g m {xi 



2-/r = > arctan — = > arctan x — > > arctan 

Z J n nr-. Z J n nf, 1 ^ ^ 



J " Xl 3=2 



frtffm(xi) 



j=2 J 1 



J — Xl j -x 2 , 

so (IB~3l) follows. □ 
Corollary B.4. For an?/ m > 6 and — oo < x < 2, define 

fm{x) := (2 - x + i# m (x))(3 - x + ig m {x)) ... (to - x + ig m (x)), 
which by Claim liOl is positive real-valued. Then f m is continuous. 
Proof. This is immediate from Claim IB~3l □ 
Claim B.5. For fixed — oo < x < 2, g m {x) is strictly decreasing in to > 6. 
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Proof. If m > 6, then 

m / n m+l / \ 

E 5m ( X J ^ — > Qm \ ) 
arctan < > arctan , 

j — x j — x 

i=2 J j=2 J 

so g m+ i{x) < g m {x). □ 

Lemma B.6. For m > 4, /ei A m = a m + i/? m (with (3 m > 0) be a root of the indicial 
polynomial ip m with second largest real part. Then |A m + to — 1| <m+l. 



Proof. If \X m + m — 1| > m + 1, then by the triangle inequality | X m + j\ > j + 2 for all 
j = 0, . . . , m — 2. But then |A™ _1 | = nj^o 2 \\n + j| > m 'i which is a contradiction. □ 

Combining our preliminary results we can now prove the asserted monotonicity of a. 



Theorem B.7. The second largest real part, a m , among roots of the indicial polyno- 
mial ij) m is strictly increasing in m > 3. 

Proof. For m < 6 the result is easily verified. We now show that a m -fi > a m for m > 6. 
Observe / m+ i(0) = |/ w+ i(0)| > \(m + 1)!| = (m + 1)! and 

/m+l (2 — a m ) = |/m+i(2 — a m )| 

= | [«m + «5m+i(2 - a m )] . . . [a m + (m - 2) + ig m+ i(2 - a m )\ 
x [a + (m - 1) + «5 TO+ i(2 - a m )]| 

< | [a m + ig m (2 - a m )] . . . [a m + (m - 2) + ig m {2 - a m )} (B.4) 

x [a m + (m - 1) + «5 m (2 - a m )]| 

< m\\a m + (m - 1) + «/3 m | (B.5) 
= m!|A m + m — 1| < (m + 1)!. 

Inequality (|B.4|) follows from Claim lB~5l inequality (|B.5|) holds since g m (2 — a m ) < /3 m , 
and the last inequality is a consequence of Lemma IB. 61 Therefore, by Corollary IB.41 
f m +i(x) = (m + 1)! for some < x < 2 — a m , and so A m = (m + 1)! for some A with 
Re A > a m . That is, a m+ i > a m , as desired. □ 

Remark B.8 (Asymptotics of a m ). Using (B.7) and (B.8) from ^H] and the characteri- 
zation of g m ( x ) in Claim |B~T1 we can establish 

a m = 2 - (1 + o(l)) 2vr 2 ^ - 1^ hr 3 m, 

P m = (l + o(l))2 7 rln- 1 m; 

we omit the proof. Thus the Proposition in Appendix B of ^H] is asymptotically optimal, 
to first order. 
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B.2 Identities involving the indicial polynomial 

Identity B.9. When A ^ {Ax, • • • , \ m -x}, 

m-i 1 1 

(r) 

For r and n positive integers, let H\ ' denote the rth-order harmonic number 

- 1 
H (r) : =Y-. 

3=1 

When r = 1 we will use H n := Hn^ for the usual (lst-order) harmonic number. 
Identity B.10. For < k < m - 3, 

m—1 \k 

Identity B.ll. 

^'(2)=m!(tf m -l) and ^"(2) = m\[{H m - I) 2 - (H$ - 1)]. 
Identity B.12. 



m-i 1 1 

^ (A,- - 2W(\ 3 ) = 2^X) 



1 



_ 1 

±±rn, -L 



(H m - 1) 2 
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